squ96I 
haelll 
asul 
sau96I 
nlalV 
hglJIl 
ecoD109I 
bspl286 
ban 1 1 
asul 
nlalV 

mil aval apal nnll 

nlooll nnll ecoD109I "Hhllll 

1 GGA CTT GTC TTC CTC GTC CTG CTG TTC CTC GGG GCC CTC GGA CTG 
-18 Gly Leu Val Phe Leu Val Leu Leu Phe Leu Gly Ala Leu Gly Leu 

haelll 

eael hlnPI 
cfrl W™1 
46 TGT CTG GCT GGC CGT AGG AGA AGG AGT GTT CAG TGG TGC GCC GTA TCC 
-3 Cys Leu Ala Gly Arg Arg Arg Arg Ser Val Gin Trp Cys Ala Val Ser 

haelll 
mil 
aval hael 

94 CAA CCC GAG GCC ACA AAA TGC TTC CAA TGG CAA AGG AAT ATG AGA AAA 
14 Gin Pro Glu Ala Thr Lys Cys Phe Gin Trp Gin Arg Asn Met Arg Lys 

nnll fnu4Hl 
sau96I bbvl plel 

haelll alul hlnfl bsrl 

asul pvull bsnal fokl 

142 GTG CTG GGC CCT CCT GTC AGC TGC ATA AAG AGA GAC TCC CCC ATC CAG 
30 Val Arg Gly Pro Pro Val Ser Cys lie Lys Arg Asp Ser Pro lie Gin 

haelll 
hael 

scrFl haelll 

ecoRH sau96I 

bs-tNl asul sfaNl 

190 TGT ATC CAG GCC ATT GCG GAA AAC AGG GCC GAT GCT GTG ACC CTT GAT 

46 Cys He Gin Ala He Ala Glu Asn Arg Ala Asp Ala Val Thr Leu Asp 

sau96I 
nlalV 
scrFl 
ecoRll 
bs-tNl 
haelll 
s-tul haelll 
nnll hael asul 
238 GGT GGT TTC ATA TAC GAG GCA GGC CTG GCC CCC TAC AAA CTG CGA CCT 
62 Gly Gly Phe lie Tyr Glu Ala Gly Leu Ala Pro Tyr Lys Leu Arg Pro 



FIG.-1A 



sau96I 

avail 

asul 

fnu4M accl nlalV 
286 GTA GCG GCG GAA GTC TAC GGG ACC GAA AGA CAG CCA CGA ACT CAC TAT 
78 Val Ala Ala Glu Val Tyr Gly Thr Glu Arg Gin Pro Arg Thr His Tyr 



fnu4HI 

nboll bbvl alul 

hphl fnu4HI alul pvull 

334 TAT GCC GTG GCT GTG GTG AAG AAG GGC GGC AGC TTT CAG CTG AAC GAA 

94 Tyr Ala Val Ala Val Val Lys Lys Gly Gly Ser Phe Gin Leu Asn Glu 

haelll sau96I 
stul avail 
bgll had asul fokl 

382 CTG CAA GGT CTG AAG TCC TGC CAC ACA GGC CTT CGC AGG ACC GCT GGA 

110 Leu Gin Gly Leu Lys Ser Cys His Thr Gly Leu Arg Arg Thr Ala Gly 

sau961 
avail 
asul 
nlalV 

430 TGG AAT GTC CCT ACA GGG ACA CTT CGT CCA TTC TTG AAT TGG ACG GGT 
126 Trp Asn Val Pro Thr Gly Thr Leu Arg Pro Phe Leu Asn Trp Thr Gly 

hgUIl alul 
bspl286 fnu4Hl 

ban II bbvl ddel alul 

ddel mil pvull nboll pvull 

478 CCA CCT GAG CCC ATT GAG GCA GCT GTG CAG TTC TTC TCA GCC AGC TGT 

142 Pro Pro Glu Pro He Glu Ala Ala Val Gin Phe Phe Ser Ala Ser Cys 



nspl 

hpall 
scrFI 
ncll 
caull 

526 GTT CCC GGT GCA GAT AAA GGA CAG TTC CCC AAC CTG TGT CGC CTG TGT 
158 Val Pro Gin Ala Asp Lys Gly Gin Phe Pro Asn Leu Cys Arg Leu Cys 

nlalV 
scrFI 
ecoRII 
i-mtl bs-tNI rsal 
574 GCG GGG ACA GGG GAA AAC AAA TGT GCC TTC TCC TCC CAG GAA CCG TAC 
174 Ala Gly Thr Gly Glu Asn Lys Cys Ala Phe Ser Ser Gin Glu Pro Tyr 
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nloIV 
hglCI 

alul ban I ddel bsnal bsnal 

622 TTC AGC TAC TCT GGT GCC TTC AAG TGT CTG AGA GAC GGG GCT GGA GAC 
190 Phe Ser Tyr Ser Gly Ala Phe Lys Cys Leu Arg Asp Gly Ala Gly Asp 

sau96I 
avail 
asul 
ppuMI 

hglAI ecoD109I 
bspl2B6 nnll nnll 

670 GTG GCT TTT ATC AGA GAG AGC ACA GTG TTT GAG GAC CTG TCA GAC GAG 
206 Val Ala Phe He Arg Glu Ser Thr Val Phe Glu Asp Leu Ser Asp Glu 

718 GCT GAA AGG GAC GAG TAT GAG TTA CTC TGC CCA GAC AAC ACT CGG AAG 
222 Ala Glu Arg Asp Glu Tyr Glu Leu Leu Cys Pro Asp Asn Thr Arg Lys 

scrFI 

ncll 

nspl 

hp all 

caul I 
xnal sau96I 
snal nlalV 
scrFI 

ncll avail 
caull 
aval asul 
sau96I ppuMI 
haelll nlalV 

bsrl asul ecoD109I nlalll 

766 CCA GTG GAC AAG TTC AAA GAC TGC CAT CTG GCC CGG GTC CCT TCT CAT 
238 Pro Val Asp Lys Phe Lys Asp Cys His Leu Ala Arg Val Pro Ser His 

sfaNI 

fokl nboll 
bgll dralll mnll hlnfl 

814 GCC GTT GTG GCA CGA AGT GTG AAT GGC AAG GAG GAT GCC ATC TGG AAT 
254 Ala Val Val Ala Arg Ser Val Asn Gly Lys Glu Asp Ala lie Trp Asn 

scrFI 
ecoRII 

bs-fcNl hphl 
862 CTT CTC CGC CAG GCA CAG GAA AAG TTT GGA AAG GAC AAG TCA CCG AAA 
270 Leu Leu Arg Gin Ala Gin Glu Lys Phe Gly Lys Asp Lys Ser Pro Lys 
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sau3A] 
nbol 
dpnl 
xholl 

alul tas-tYI 
bs-tXI ntalV bglll 

910 TTC CAG CTC TTT GGC TCC CCT AGT GGG CAG AAA GAT CTG CTG TTC AAG 
286 Phe Gin Leu Phe Gty Ser Pro Ser Gly Gin Lys Asp Leu Leu Phe Lys 

nlalV 
hglCI 

plel nnll bspl286 nnll 

hlnfl -taql ban! aval hlnfl 

958 GAC TCT GCC ATT GGG TTT TCG AGG GTG CCC CCG AGG ATA GAT TCT GGG 
302 Asp Ser Ala He Gly Phe Ser Arg Vol Pro Pro Arg lie Asp Ser Gly 

nspl 

s"tyl hpall 
rsal nlalV fokl nnll 

1006 CTG TAC CTT GGC TCC GGC TAC TTC ACT GCC ATC CAG AAC TTG AGG AAA 
318 Leu Tyr Leu Gly Ser Gly Tyr Phe Thr Ala He Gin Asn Leu Arg Lys 

nspl 

hpall ihal 
scrFl fnuDU 
ncll bs-tUI 
nnll fnu4Hl hlnPI 

mil bbvl caull hhal 

1054 AGT GAG GAG GAA GTG GCT GCC CGG CGT GCG CGG GTC GTG TGG TGT GCG 
344 Ser Glu Glu Glu Vol Ala Ala Arg Arg Ala Arg Val Val Trp Cys Ala 

hlnPI 
ns-tl 
fspl 
fnu4Hl 

alul hhal bs-tXI 
alwNl bbvl bsrl 
1102 GTG GGC GAG CAG GAG CTG CGC AAG TGT AAC CAG TGG AGT GGC TTG AGC 
350 Val Gly Glu Gin Glu Leu Arg Lys Cys Asn Gin Trp Ser Gly Leu Ser 

fnu4HI mil 

bbvl bspMI mil haell] mil sfaNl 

1150 GAA GGC AGC GTG ACC TGC TCC TCG GCC TCC ACC ACA GAG GAC TGC ATC 

366 Glu Gly Ser Val Thr Cys Ser Ser Ala Ser Thr Thr Glu Asp Cys He 

scrFl 

ecoRll bs-tXI 
bs-tNl alul sfaNI nlalll fokl nnll 

1198 GCC CTG GTG CTG AAA GGA GAA GCT GAT GCC ATG AGT TTG GAT GGA GGA 
382 Ala Leu Val Leu Lys Gly Glu Ala Asp Ala Me"t Ser Leu Asp Gly Gly 
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n lalll nlalV scrFl 

sphl hglCI ecoRIl 

rsal nspCIx banl bstNI 

1246 TAT GTG TAC ACT GCA TGC AAA TGT GGT TTG GTG CCT GTC CTG GCA GAG 

398 Tyr Val Tyr Thr Ala Cys Lys Cys Gly Leu Val Pro Val Leu Ala Glu 

sau3Al 
nbol 
dpn I 
alwl 

1294 AAC TAC AAA TCC CAA CAA AGC AGT GAC CCT GAT CCT AAC TGT GTG GAT 
414 Asn Tyr Lys Ser Gin Gin Ser Ser Asp Pro Asp Pro Asn Cys Val Asp 

sau3Al 
nbol 

ecoNI ecoRV dpn I 

1342 AGA CCT GTG GAA GGA TAT CTT GCT GTG GCG GTG GTT AGG AGA TCA GAC 
430 Arg Pro Val Glu Gly Tyr Leu Ala Val Ala Val Val Arg Arg Ser Asp 

scrFI 

ecoRIl 

bs-tNI 

1390 ACT AGC CTT ACC TGG AAC TCT GTG AAA GGC AAG AAG TCC TGC CAC ACC 
446 Thr Ser Leu Thr Trp Asn Ser Val Lys Gly Lys Lys Ser Cys His Thr 

haelll 

n lalll 

s~tyl sau96I nbol I 

psil ncol asul earl 

1438 GCC GTG GAC AGG ACT GCA GGC TGG AAT ATC CCC ATG GGC CTG CTC TTC 

462 Ala Val Asp Arg Thr Ala Gly TrP Asn lie Pro Me-t Gly Leu Leu Phe 

nlalV 
hglJII 
bspl286 

ban II sspl alul bspl286 

1486 AAC CAG ACG GGC TCC TGC AAA TTT GAT GAA TAT TTC AGT CAA AGC TGT 
478 Asn Gin Thr Gly Ser Cys Lys Phe Asp Glu Tyr Phe Ser Gin Ser Cys 

sau3AI 

nbol 

Dpnl 

scrFI xholl 
ecoRIl bs-tYI hglAl 

bs-tNI aval bglll bspl286 

1534 GCC CCT GGG TCT GAC CCG AGA TCT AAT CTC TGT GCT CTG TGT ATT GGC 
484 Ala Pro Gly Ser Asp Pro Arg Ser Asn Leu Cys Ala Leu Cys lie Gly 
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hphl bspl286 
1582 GAC GAG CAG GGT GAG AAT AAG TGC GTG CCC AAC AGC AAT GAG AGA TAC 
510 Asp Glu Gin Gly Glu Asn Lys Cys Val Pro Asn Ser Asn Glu Arg Tyr 

nlalV 
hglCI 

banl s cr Fl 
nspl ecoRll 

hpall bs-tNl ddel bsnl bsnal 
TTC CGG TGC CTG GCT GAG AAT GCT GGA GAC 











bsrl 


1630 


TAC 


GGC 


TAC 


ACT GGG GCT 


526 


Tyr 


Gly 


Tyr 


Thr Gly Ala 


1678 


GTT 


GCA 


TTT 


GTG AAA GAT 


542 


Val 


Ala 


Phe 


Val Lys Asp 



fnu4Hl 
bbvl 
hlnPl 

nnll nlalll ddel alul hhal 

1726 AAC AAT GAG GCA TGG GCT AAG GAT TTG AAG CTG GCA GAC TTT GCG CTG 
558 Asn Asn Glu Ala Trp Ala Lys Asp Leu Lys Leu Ala Asp Phe Ala Leu 

taql fnu4Hl 

nnll nnl1 bbvI 

bgll ddel alul 

1774 CTG TGC CTC GAT GGC AAA CGG AAG CCT GTG ACT GAG GCT AGA AGC TGC 

574 Leu Cys Leu Asp Gly Lys Arg Lys Pro Val Thr Glu Ala Arg Ser Cys 

sau96I 
nlalV 
nlalll 
sty I haelll 

ncol asul hlnfl nlalll bsnal fokl 

1822 CAT CTT GCC ATG GCC CCG AAT CAT GCC GTG GTG TCT CGG ATG GAT AAG 
590 His Leu Ala Met Ala Pro Asn His Ala Val Val Ser Arg Met Asp Lys 

fnu4Hl 
ecoNI alwNl bbvl 
1870 GTG GAA CGC CTG AAA CAG GTG CTG CTC CAC CAA CAG GCT AAA TTT GGG 
606 Val Glu Arg Leu Lys Gin Val Leu Leu His Gin Gin Ala Lys Phe Gly 

sau3AI 

mbol mspl 

dpnl hpall 
xholl scrFl 
bs-tYI ncll 

alwl caull bsrl 

1919 AGA AAT GGA TCT GAC TGC CCG GAC AAG TTT TGC TTA TTC CAG TCT GAA 
622 Arg Asn Gly Ser Asp Cys Pro Asp Lys Phe Cys Leu Phe Gin Ser Glu 
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haelll 
hQel 

eael s"tyl 
ddel cfrl plel ncol 

dralll boll hinfl 

1966 ACC AAA AAC CTT CTG TTC A AT GAC AAC ACT GAG TGT CTG GCC AGA CTC 
638 Thr Lys Asn Leu Leu Phe Asn Asp Asn Thr Glu Cys Leu Ala Arg Leu 

sau96I 

avail 

asul 

nlalll ndel sspl nlalV 

2014 CAT GGC AAA ACA ACA TAT GAA AAA TAT TTG GGA CCA CAG TAT GTC GCA 
654 His Gly Lys Thr Thr Tyr Glu Lys Tyr Leu Gly Pro Gin Tyr Val Ala 

scrPI 
ecoRIl 

hglAI bs-tNl 
bspl286 nnll nnll 
2062 GGC ATT ACT AAT CGT AAA AAG TGC TCA ACC TCC CCC CTC CTG GAA GCC 
670 Gly lie Thr Asn Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu Ala 

ddel 
ns-tll 

nnll sau96I 
eco811 nboll haelll 

ecoRl bsu36I nboll asul alul 

2110 TGT GAA TTC CTC AGG AAG TAA AACCGAAGAA GATGGCCCAG CTCCCCAAGA 
685 Cys Glu Phe Leu Arg Lys DC* 

s-tyl 
haelll 
sau96I 

nboll scrFl asul 
ddel earl ecoRIl nlalV 

nn H alul bs-tNl ecoD109I nlalV 

2161 AAGCCTCAGC CATTCACTGC CCCCAGCTCT TCTCCCCAGG TGTGTTGGGG CCTTGGCTCC 

ecoNl fokl ddeI 

2221 CCTGCTGAAG GTGGGGATTG CCCATCCATC TGCTTACAAT TCCCTGCTGT CGTCTTAGCA 

2281 AGAAGTAAAA TGAGAAATTT TGTTGATATT CAAAAAAAA 



>LENGTH< 2319 
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nlalll nlalV scrFl 

sphl hglCI ecoRIl 

rsal nspCIx ban I tasiNI 

1246 TAT GTG TAC ACT GCA TGC AAA TGT GGT TTG GTG CCT GTC CTG GCA GAG 

398 Tyr Val Tyr Thr Ala Cys Lys Cys Gly Leu Val Pro Val Leu Ala Gtu 

sau3Al 
nbol 

dpnl - - % 

alwl H 
1294 AAC TAC AAA TCC CAA CAA AGC AGT GAC CCT GAT CCT AAC T GT c.GT,G GAT/ 
414 Asn Tyr Lys Ser Gin Gin Ser Ser Asp Pro Asp Pro Asn Cys . Val -Asp 

sau3AI 
nbol 

ecoNI ecoRV dpnl 

1342 AGA CCT GTG GAA GGA TAT CTT GCT GTG GCG GTG GTT AGG AGA TCA GAC 
430 Arg Pro Val Glu Gly Tyr Leu Ala Val Ala Val Val Arg Arg Ser Asp 

scrFI 

ecoRIl 

bs-tNI 

1390 ACT AGC CTT ACC TGG AAC TCT GTG AAA GGC AAG AAG TCC TGC CAC ACC 
446 Thr Ser Leu Thr Trp Asn Ser Val Lys Gly Lys Lys Ser Cys His Thr 

haelll 

nlalll 
siyl sau96I mboll 
pstl ncol asul earl 

1438 GCC GTG GAC AGG ACT GCA GGC TGG AAT ATC CCC ATG GGC CTG CTC TTC 
462 Ala Val Asp Arg Thr Ala Gly TrP Asn He Pro Me* Gly Leu Leu Phe 

nlalV 
hgiJII 
bspl286 

ban II sspl alul bspl286 

1486 AAC CAG ACG GGC TCC TGC AAA TTT GAT GAA TAT TTC AGT CAA AGC TGT 
478 Asn Gin Thr Gly Ser Cys Lys Phe Asp Glu Tyr Phe Ser Gin Ser Cys 

sau3Al 

nbol 

Dpnl 

scrFI xholl 
ecoRIl bs-tYl hgiAI 

bs-tNI oval tag 1 1 1 bspl286 

1534 GCC CCT GGG TCT GAC CCG AGA TCT AAT CTC TGT GCT CTG TGT ATT GGC 
484 Ala Pro Gly Ser Asp Pro Arg Ser Asn Leu Cys Ala Leu Cys lie Gly 
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